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ABSTRACT 



The redshift-space correlation function £ s for projected galaxy separations lh^ 1 Mpc can be 
expressed as the convolution of the real-space correlation function with the galaxy pairwise velocity 
distribution function (PVDF). An exponential PVDF yields the best fit to the £ s measured from 
galaxy samples of different redshift surveys. We show that this exponential PVDF is not merely 
a fitting function but arises from well defined gravitational processes. Two ingredients conspire to 
yield a PVDF with a nearly exponential shape: (i) the number density n(a) of systems with velocity 
dispersion a; (ii) the unrelaxed dynamical state of most galaxy systems. The former ingredient 
determines the exponential tail and the latter the central peak of the PVDF. 

We examine a third issue: the transfer of orbital kinetic energy to galaxy internal degrees of 
freedom. Although this effect is of secondary importance for the PVDF exponential shape, it is 
detectable in galaxy groups, indicating that galaxy merging is an ongoing process in the present 
Universe. 

We compare the £ s measured on non-linear scales from galaxy samples of the Center for Astro- 
physics redshift surveys with different models of the PVDF convolved with the measured real-space 
correlation function. This preliminary comparison indicates that the agreement between model and 
observations depends strongly on both the underlying cosmological model and the internal dynamics 
of galaxy systems. Neither parameter dominates. Moreover, the agreement depends sensitively on 
the accuracy of the galaxy position and velocity measurements. 

We expect that £ s will pose further constraints on the model of the Universe and will improve the 
knowledge of the dynamics of galaxy systems on very small scales if we improve (i) the galaxy coor- 
dinate determination and (ii) the measurement of relative velocities of galaxies with small projected 
separation. In fact, the redshift-space correlation function £ s depends sensitively on the internal 
pairwise velocity distribution of individual galaxy systems for projected pair separations ^ 0.5/i _1 
Mpc and relative velocities it 300 km s _1 . 

Subject Headings: Cosmology: Dark Matter - Cosmology: Theory - Galaxies: Clustering - Galaxies: 
Interaction - Gravitation 
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1. INTRODUCTION 

The pairwise velocity distribution function (PVDF) of galaxy systems has been studied since 
Geller & Peebles (1973) first used it to determine the mean mass of galaxy groups statistically. 
The PVDF assumed clear importance in cosmology when Peebles (1976) used it to determine the 
pairwise velocity dispersion ai 2 (r) of galaxy pairs separated by a projected distance r ^ lft -1 Mpc 1 
and ultimately to determine the mean mass density of the Universe. 

Davis & Peebles (1983) first computed the rcdshift-space correlation function £ s as a convolution 
of the real space two-point correlation function with the galaxy PVDF. Recently, Fisher et al. (1994b) 
and Marzke et al. (1995) used the same "convolution method" to determine £ s and the pairwise 
velocity dispersion <J\2{r) on non-linear scales. All of this work, starting from Peebles (1976), 
assumes an exponential PVDF. Bean et al. (1983), Fisher et al. (1994b) and Marzke et al. (1995) 
demonstrate quantitatively that this shape fits the observations better than other distributions. 
However, so far the exponential shape has been a fitting function without any physical justification. 

It is not clear what PVDF we should expect on non-linear scale where non-linear gravitational 
clustering erases information about the initial conditions. On linear scales (r <^ 10ft _1 Mpc), if 
the standard inflationary model is valid, we expect a Gaussian PVDF (e.g. Nusser, Dekel, & Yahil 
1995). As a 2-point distribution, the PVDF is a more powerful tool than 1-point distributions to 
determine whether the density fluctuations are Gaussian or non-Gaussian (e.g. Kofman et al. 1994; 
Catelan & Scherrer 1995). If the PVDF is indeed Gaussian on linear scales, we need a link between 
the observed exponential PVDF on non-linear scales and the expected Gaussian in the linear regime 
(Fisher 1995). 

Numerical work devotes attention mainly to the pairwise velocity dispersion a\i (r) rather than 
to the shape of the PVDF (e.g. Couchman & Carlberg 1992; Gelb & Bertschinger 1994). Efstathiou 
et al. (1988) simulate a flat Universe with scale free initial conditions; by analyzing the particle 
velocity field, they find a skewed PVDF with exponential tails but a flatter core at small relative 
velocities. Cen & Ostriker (1993) implicitly find the same result by simulating a standard CDM 
Universe including dissipative galaxy formation. Their single-galaxy peculiar velocity exponential 
distribution implies a PVDF similar to the one found by Efstathiou et al. (Marzke et al. 1995). A 
variety of CDM models (Fisher et al. 1994b) confirm this behavior. All of this work has a dynamic 
range of roughly three orders of magnitude. With an order of magnitude increase in dynamic range, 
Zurek et al. (1994) study the massive halo velocity field and find an exponential skewed PVDF 
at all projected separations between 0.5ft. -1 Mpc and 5.5ft -1 Mpc and for all relative velocities, 
indicating that the flat core of previous simulations probably arose from an inadequate treatment 
of gravitational interactions on small scales. 

All previous work, both observational and numerical, does not explain the physical origin of 
the exponential shape of the PVDF. Here we propose a simple physical argument for the observed 
exponential PVDF for galaxy separations ^ 1ft. -1 Mpc. 

If n{a) is the number density of galaxy systems with velocity dispersion a, we show that the 
exponential tail can be obtained from the integral of Gaussian internal velocity distributions for 
each galaxy system weighted either with the observed n(a) or with the n(a) predicted by the Press 
& Schechter (1974) theory (Sect. 2). 

Sect. 3 shows that the central peak of the PVDF requires the presence of unrelaxed systems 

1 Hq = 100ft km s -1 Mpc -1 is the present Hubble constant and we use ft = 0.5 throughout. 



3 



with a non-Gaussian internal velocity distribution. In Sect. 4 we examine a further process which 
can peak up the PVDF at small relative velocities: the transfer of orbital kinetic energy to galaxy 
internal degrees of freedom. In Sect. 5 we compare various models of the PVDF with the redshift- 
space correlation function measured for the Center for Astrophysics (CfA) magnitude limited rcdshift 
surveys. 

2. THE PVDF FROM n{a) 

Suppose that the probability of measuring one component u of the relative velocity of two 
galaxies within a particular system is a universal function A(u, a), where a is the velocity dispersion 
of the system and u is independent of the galaxy separation distance. Assume that n(er) is the 
number density of systems with dispersion a. Moreover, assume that the number of galaxies v 
within a system with dispersion a depends only on a: v = v(a). The probability of choosing 
a single galaxy is n{a)v(a) and the probability of picking a galaxy pair within a single system is 
n 2 \a)v 2 (a) I 'n(a) . Assume, for the sake of simplicity, that all the systems are disjoint with separation 
> Ih^ 1 Mpc. Thus the contribution to the pairwise velocity distribution p(u) for galaxy separation 
^ lh~ x Mpc comes only from galaxy pairs in the same system. Therefore, we can neglect the relative 
velocities of systems. Then we have 

K u .°'min,0'max)dM oc du I v 2 {a)n(a) A(u, a) da. (2.1) 

J CTmin 

Hereafter, we refer to equation (2.1) as the PVDF. Let us now make a few hypotheses which 
roughly approximate the internal properties of galaxy systems. Let us assume that systems have 
relaxed violently (Lynden-Bell 1967; Shu 1978). We can then assume that systems approximate 
truncated singular isothermal spheres with density profile p(r) = a 2 /2irGr 2 . TV-body simulations 
(Crone, Evrard, & Richstone 1994; Carlberg 1994; Navarro, Frenk, & White 1996; Cole & Lacey 
1996) show that this profile is not correct at very small and very large radii. However, the slope 
r~ 2 fits the dark halo density profile at least over the range 0.1 ^ r/r v i r 1 where r v i r is the radius 
containing an overdensity of 200 (Navarro et al. 1996). We arc interested in the relation between 
the galaxy number v and the velocity dispersion a; thus the assumption v{a) oc a 2 is reasonable. 
The isothermal model has a Gaussian velocity distribution. Thus 

K{u,a)du = {4 J n)1/2 exp (-^j du. (2.2) 

We have two choices for the number density n(a): (1) we can assume the distribution derived 
from the Press & Schechter (1974) theory which approximates the number density of massive halos 
in iV-body simulations of a flat Universe with scale free initial conditions (e.g. Efstathiou et al. 
1988; Lacey & Cole 1994); (2) we can use the observed distribution. 

The Press-Schcchtcr n(a) can be easily derived for a flat CDM universe dominated by dissi- 
pationless dark matter. Following White & Frenk (1991), consider spherical perturbations with 
comoving radius ro which have already collapsed into isothermal spheres by rcdshift z. For singular 
isothermal halos the velocity dispersion is a 2 = GM (r)/2r, independent of radius r. If the halo mass 
is M — 47rpoH)/3 where p is the present density of the Universe, the velocity dispersion can be 
expressed in terms of the redshift and the initial size of the perturbation: a — 1.68(l + z) 1 ^ 2 -ffo r o/v / 2 
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where Ho is the present Hubble constant (however, see e.g. Jing & Fang 1994; or Crone & Geller 
1995 for mass-dispersion relations when the isothermal sphere approximation is not valid). The 
number of halos with dispersion a per comoving volume at redshift z is then 

3(1.68) 3 ff 3 (l + z) 3 / 2 dlnA „ a/2 , . , 

n(a)da = / °.\ , ; ve~ v /2 da 2.3a 

V ; (47r) 3 / 2 cr 4 diner V ; 

where v = 5 C (1 + z)/ A and S c = 1.69 is the mean linear interior mass overdensity when each spherical 
shell recollapses to the origin (Narayan & White 1988). The rms linear mass overdensity in a sphere 
of radius ro is 

A(r ) = 16.3cr 8 (l - 0.39097-S 1 + 0.4814rEJ- 2 )- 10 (2.36) 

where erf is the usual ratio of the variances of the mass and galaxy fluctuations within randomly 
placed spheres of radius 8/i _1 Mpc. Equation (2.3b) approximates the correct A(r ) to within 
10% over the range 0.03/i _1 Mpc < r < 20/i _1 Mpc. The correct A(r ) is obtained through the 
convolution of the CDM linear power spectrum with the spherical top-hat window function of radius 
ro. The power spectrum, assuming fio = 1, h = 0.5 and a cosmic microwave background temperature 
9 = 2.7 °K, is (Davis et al. 1985) 

P ^ = L94 x loV «(iTw4^ Mpc3 (2 ' 3c) 

The only free parameter is now the normalization parameter erg. 

A sample of 25 Abell clusters with velocity dispersion a > 300 km s^ 1 and 31 galaxy groups in 
the CfA redshift surveys with a > 100 km s" 1 (Zabludoff et al. 1993b) yields 

n(a)da cx 10 ac7 dcr (2.4) 

where a — 0.0015. Equation (2.4) holds for a > 700 km s _1 . Mazure et al. (1996) analyze a 
volume-limited sample of 128 Abell clusters with richness R > 1. They find a similar n(a) with 
a ~ 0.0016 for a > 800 km s _1 . In both cases, the distribution is shallower for smaller a. Thus, 
using equation (2.4) for the whole range of a overestimates the number of system with small velocity 
dispersion. However, such an overestimate does not affect our analysis. In fact, we shall see that we 
need even more systems with a < 700 km s _1 to obtain an exponential PVDF. 

With these assumptions the integral in equation (2.1) at z — yields the PVDFs in Fig. 1 for 
different values of (7 m i n and for as = 0.5, 1.0, and 1.5, to map the range of COBE normalizations 
for different CDM models (Bunn, Scott & White 1995). We set cr max = 1500 km s" 1 . 

If we decrease cr m i n , the Press- Schechter ra(er) includes a larger fraction of halos of small size 
and peaks up the center of the distribution. However, most halos with a ^ 150 km s^ 1 are likely to 
contain at most one galaxy as luminous as the Milky Way. Systems of galaxies with a ^ 100 km s _1 
are only a small fraction of the total number of systems predicted by the Press-Schechter theory at 
these velocity dispersions. Moreover, it is well known that the Press-Schechter n(a) for a flat CDM 
universe overestimates the observed number of single galaxy halos with a ^ 100 km s^ 1 (see e.g. 
White 1993). If we set a m i n ~ 100 km s _1 as the lower limit of integration in equation (2.1), we 
mostly exclude halos containing only a single galaxy. 
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Figure 1 The PVDF computed through equation (2.1) with a Gaussian internal pairwise velocity distribu- 
tion A(u, a). The number density of galaxy systems is the observed n(a) (eq. [2.4]) or the Press-Schechter 
n(a) for a fiat CDM universe (ecjs. [2.3]) with different normalization eg. We take (Jmax — 

1500 km s" 1 . 

At large relative velocities u, from top to bottom, the curves in each panel have a m i n = 500, 400, 300, 200, 
and 100 km s , respectively. 



The observed n(a) does not include individual galaxies by definition and does not overestimate 
the number of galaxy systems with small a. Therefore, the PVDF does not change appreciably for 
any choice of 

^min ■ 

Fig. 1 shows that for reasonable values of (T min <; 100 km s^ 1 , equation (2.1) predicts a PVDF 
which is almost exponential at large relative velocities u. However, the PVDF bends over at smaller 
u. In order to obtain an exponential core we need a different model for A(u, a). 

3. A(u,a) OF UNRELAXED SYSTEMS 

It is well known that steady-state self-gravitating systems cannot have exactly Gaussian velocity 
distributions because escaping stars deplete the high- velocity tails (e.g. King 1965, 1966). In fact, the 
isothermal sphere is the only self-gravitating system with a Gaussian velocity distribution. However, 
its mass is infinite and real steady state systems tend only asymptotically to the Gaussian distribution 
(e.g. Padmanabhan 1990). 

We should not expect a Gaussian distribution in galaxy systems for another reason: many 
observed galaxy systems - from groups to clusters - are unlikely to be relaxed. Most galaxy groups 
are still collapsing (e.g. Diaferio et al. 1993; Doe et al. 1995) and many clusters contain substructures 
which indicate that they are far from equilibrium (e.g. West, Jones, & Forman 1995; Colless & 
Dunn 1996). Therefore, a single Gaussian is not a good approximation to their velocity distribution. 
Velocity distributions will depend on the initial conditions and on the dynamical state of the system. 
In general, there will not be a universal A(u, a) for all galaxy systems. 

These arguments apparently show that the assumptions about the shape and the uniqueness of 
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the distribution A(u) in Sect. 2 are inadequate. However, we can investigate what shape A(u) tends 
to assume in a hierarchical clustering scenario, where either systems are virialized or they are still 
forming through the aggregation of virialized subunits. 

To model the evolution of a cluster by accretion of subunits, the excursion set formalism (see 
e.g. Bower 1991; Bond et al. 1991; Kauffmann & White 1993; Lacey & Cole 1993) extends the 
Press-Schechter formalism to estimate the number density of halos with mass Mi at redshift z\ 
which will merge at different times to form a single halo of mass M 2 > Mi at redshift z 2 < zi . We 
have 



n(M 1 ,z 1 \M 2 ,z 2 )dM 1 = 



1/2 

Po 
Mi 



din Ax 



x exp 



dM 1 
{Si - 5 2 ) 2 
"2(A?-A|) 



(A?-A|)3/2 

dMi (3.1a) 



where po is the present density of the Universe, A? (i = 1, 2) is the variance of the linear over- 
density in a sphere containing the mass Mj, and Si = 1.69/D(zi) is the extrapolated linear critical 
overdensity for which perturbations with overdensity S > Si have collapsed at redshift Zj. D{zi) is 
the perturbation growth factor. 

Following the procedure used to obtain the number density of halos with velocity dispersion a 
(eq. [2.3a]), we can use equation (3.1a) to express the number density of halos with velocity dispersion 
<Ti at redshift zi which will form a halo with dispersion <j 2 at redshift z 2 < z\. In a flat CDM 
universe, D{zi) = (1 + Mass and dispersion of each halo arc related by Mi = 47r / 9o[f'o 4 ' ) ] 3 /3 

and <jj = 1.68(1 + Zi) 1 / 2 !!^^' /V2- Equation (3.1a) becomes 

f 1 w 3(1.68) 3 g 3 (l + zi) 3 / 2 dlnAi A? . _- a/2 
n(ai, Zl \a 2 ,z 2 )dai = , /l _ N , /9 _ 4 T2 T2 ve d<J i ( 3 - 15 ) 



(47r) 3 /Vf dlnaiAf-A 



where v = 1.69(zi — Z2)/(Ai — A^) 1 ' 2 and we express Aj with the approximation of equation (2.3b). 

Let us now suppose that we observe a galaxy system at redshift z\ = z which has not yet 
collapsed, but it still contains different substructures which will merge to form a single halo with 
velocity dispersion a 2 — cr max at a later epoch, e.g. z 2 = 0. We want to compute the probability A(w) 
of measuring a velocity difference u between two galaxies within this collapsing system at redshift 
z. 

If we assume that each subunit has a velocity distribution ip(v,a), the probability of choosing 
a galaxy with velocity v within the system is 

a(v,a min ,cr niax ,z)dv cx dv I n(a, z|(T max , 0)u(a)tl>(v, o)do (3.2a) 

-'a' m i n 

where n(cr, z|cr max , 0) is given by equation (3.1b) and v{a) is the number of galaxies within each 
substructure as in Sect. 2. For the sake of simplicity, equation (3.2a) ignores the relative velocities 
of the subunits. The inclusion of this effect will probably broaden the velocity distribution a. Thus, 
equation (3.2a) is conservative with respect to our purpose of investigating the departure of a from 
a Gaussian. Here however, we limit our analysis to the simplest case. 

We notice that equation (3.2a) is a velocity distribution decomposition if we look at it the other 
way around. In other words, we "build" the velocity distribution instead of decomposing it. For 
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Figure 2 Pairwise velocity difference distribution A(u) for a single unrelaxed galaxy system (eq. [3.2b]) at 
different redshift z and with different normalization cr 8 . We take <J m i n = 100 km s _1 . At large relative 
velocities u, from top to bottom, the curves in each panel have a max = 1500, 1300, 1100, 900, 700, 500, 
and 300 km s _1 , respectively. 

example, van der Marel & Franx (1993) decompose line profiles of elliptical galaxies in orthogonal 
Gauss-Hermite functions to quantify departures from Gaussian line profiles. Zabludoff, Franx, & 
Gellcr (1993a) apply a discrete version of this technique to the velocity distributions of eight rich 
Abell clusters. Here, we point out that the hierarchical clustering scenario naturally predicts that 
a(v) is a sum of elementary distributions. 

Equation (3.2a) quantifies the degree of subclustering in a galaxy system. Comparison of 
equation (3.2a) with velocity distributions of real clusters may provide constraints on the density of 
the Universe (e.g. Evrard et al. 1993, Jing et al. 1995). Moreover, extensions of equation (3.2a) can 
determine the rate of growth of clusters as a function of redshift (Lacey & Cole 1993). 

Now we can write the pairwise velocity distribution A(u) as 

A(M,cr m in, c max , z)du <xdu J a(vi)a(v 2 )5(\vi - v 2 \ - u)dvidv 2 

=du J a{v\)a{vi + u)dv\. (3.26) 

Let us assume that the subunits are virialized and that each approximates an isothermal sphere. 
Therefore, ip(v, a) is Gaussian and v{o) oc a 2 as in Sect. 2. Integration of equation (3.2b) yields 
the curves in Fig. 2 for different values of <7 max , i.e. for different masses of the final dark halo, and 
for <7 m i n = 100 km s _1 . Fig. 2 shows that as we decrease the redshift z, i.e. as we come closer to 
the formation of the final system, A(u) approaches a Gaussian, as expected. At high redshift the 
system is far from equilibrium and A(u) is more centrally peaked. 

Is the presence of substructure the only physical process responsible for more centrally peaked 
A(it)'s? Formally, the presence of substructures implies that A(u) is a weighted integral of elementary 
distributions (eqs. [3.2]). This assumption is common to other fields: weighted integrals of Gaussian 
distributions are also invoked to explain the exponential shape of molecular cloud emission lines 
(e.g. Ida & Taguchi 1996; but see also Miesch & Scalo 1995) or the small scale velocity gradient 
distribution in turbulent flows (Castaing, Gagne, & Hopfinger 1990; see also She 1991). 
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However, there is a very simple example in gravitational dynamics where a centrally peaked 
A(u) docs not originate from a weighted integral. Consider a collapsed region subject to secondary 
infall (Gunn & Gott 1972): infalling galaxies have small relative velocities and peak up the center 
of the distribution whereas galaxies in the virialized region populate the exponential tails. 

An TV-body toy model illustrates this issue. Consider an isolated mass sphere with initial radial 
density p(r) = (M/4irR 3 )(R/r) 2 , where R is the radius and M the total mass of the sphere. If 
each shell is initially expanding according to the Hubble flow f(0) = HqVq, the maximum expansion 
radius is r max = r exp(i?) at time i max = H^ 1 ^JnB cxp(£?)P(l/2, B), where B = H^rlR/2GM 
and P(a, x) is the incomplete gamma function. 

We choose this density profile for two reasons: (1) the difference between the time and the 
radius at maximum expansion of two different shells grows exponentially and distinguishes the infall 
and the virialized regions clearly for our illustrative purpose; (2) the virialized region has the quasi- 
equilibrium density profile r~ 2 . Thus, at the same time, we minimize the relaxation process and 
isolate the effect due to the infall region. 

We follow the evolution of an isolated spherical system with N — 4096 particles initially expand- 
ing with the Hubble flow. In numerical units, the gravitational constant is G = 1, the system has total 
mass M = 1 and radius R = 1, and the initial Hubble constant is H = 1.2. We use the TREECODE 
by Hernquist (1987) with softening parameter s = 0.02i? and tolerance parameter 9 — 0.8. We inte- 
grate the particle equations of motion for two collapse times t c = 27r(3/10) 3 / 2 GM 5 / 2 /|i?| 3 / 2 , where 
E is the total energy of the system. The integration time step is At = 1 0~ 3 t c . 

Fig. 3a shows the evolution of the distribution of the velocity component v z . We show three 
distributions at each time: the total distribution (solid histogram), the virialized region distribution 
(bold histogram) and the infall region distribution (dashed histogram). At each time t, particles 
with 4t max < t belong to the virialized region and particles with 4t max > t belong to the infall 
region. We superimpose the best Gaussian fit on the virialized region distribution to show that 
virialization indeed took place in the central region. We choose the time limit 4i max > 2i max to 
suppress oscillation effects. Fig. 3a clearly shows that the infall region is responsible for the central 
peak and the virialized region is responsible for the tails of the total distribution. Fig. 3b show that 
the total distributions in Fig. 3a yield nearly exponential A(w)'s. 

We also ran a simulation similar to the one above but with an initial density profile p(r) = 
(M/2nR 3 )(R/r) which yields r max = H^/2B + r and t max = H r /B where B = GM/R 2 . The 
system relaxes faster than with an initial r~ 2 density profile and secondary infall lasts for only a 
small fraction of the collapse time (Fig. 3c). However, when secondary infall involves a large mass 
fraction of the sphere, A(u) is exponential (Fig. 3d). 

Velocity distributions like those in Fig. 3a are difficult to observe in individual real systems. 
Infall regions where galaxies have small velocities relative to the center of mass of the cluster are 
close to the turnaround radius which is ^ l/i" 1 Mpc for a typical Abell cluster with mass ~ 10 14 /i _1 
M Q , i.e. richness R = 1. Those regions are contaminated by foreground and background objects and 
they are generally poorly sampled. Therefore, when systems are observed individually, observational 
biases imply that departures from Gaussian velocity distributions in observed clusters are more likely 
due to subclustering rather than to infall region effects. However, both effects are present in redshift 
surveys where large regions of the Universe are sampled. 

The preceding discussion shows that unrelaxed systems imply that A(w)'s differ from Gaussians. 
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Figure 3a Evolution of the distribution of the velocity components v z for an isolated spherical system of 
particles initially expanding with the Hubble flow and with (a) an initial r~ 2 density profile or (c) an initial 
r _1 density profile. Times are in units of the collapse time t c . The solid histogram is the total distribution, 
the bold histogram is the distribution for particles within the virialized core and the dashed histogram for 
particles within the infall region. The curves are the best Gaussian fits to the virialized core distributions. 
Panels (b) and (d) show the evolution of the pairwise velocity difference distribution A(u) for the system 
with initial r~ 2 or r~ 1 density profiles, respectively. 




Figure 3b 

The A(w)'s for unrelaxed systems tend to have a more pronounced central peak than Gaussian 
distributions. A combination of substructures and infall regions contribute to this shape. 

It is now clear that a unique and universal A(u, a) does not exist, but rather each system has a 
distribution depending on its particular dynamical state. However, for the sake of investigation, we 
persist in the assumption of a universal A(u) and we examine the way the PVDF p(u) in equation 
(2.1) changes when A(u, a) differs from a Gaussian. Figs. 2 and 3 suggest that systems far from 
equilibrium may have a distribution similar to an exponential 

A(u, a)du= — = cxp( — ^\ u \ j d u (3 3) 

ay/2 y cr J 

With an exponential A(u) the integral in equation (2.1) yields the curves in Fig. 4. We clearly 
see that the central peak is more pronounced than in the Gaussian case (Fig. 1) yielding a better 
approximation to an "exponential" shape for the PVDF. 
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Figure 3c 




We stress here that the results of Figs. 1 and 4 are based on the generic assumption that all 
galaxy systems have the same internal pairwise velocity distribution A(u, a). However, the use of a 
unique A(it, a) is still meaningful if we interpret A(u, a) as a convolution of different internal pairwise 
velocity distributions of individual systems at different dynamical states but with the same a. Thus, 
the results of Figs. 1 and 4 allow us to conclude that if the real PVDF is indeed exponential over a 
wide range of relative velocities u, the blend of different internal pairwise velocity distributions must 
contain a large fraction of distributions more centrally peaked than a Gaussian. In other words, 
the exponential shape of the PVDF is a signature of the presence of a large fraction of unrelaxed 
systems. 

4. A(u, a) OF DISSIPATIVE SYSTEMS 

In Sect. 3 we show that unrelaxed systems have internal pairwise velocity distribution A(u,a) 
more centrally peaked than a Gaussian. This shape arises from secondary infall and the presence of 
substructures. Here we investigate a third physical process which can peak up the A(u) distribution 
at small relative velocities: the transfer of orbital kinetic energy to galaxy internal degrees of freedom. 
In fact, this effect has only a secondary impact on the velocity distribution. 

To examine this issue, one should solve the complete Boltzmann equation for the evolution of 
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Figure 4 Same as Fig. 1, but with an exponential A(u, <r) (eq. [3.3]). 

the phase space density of the galaxy system including a collisional term. Fusco-Femiano & Mcnci 
(1995) study how the velocity distribution evolves in the presence of binary mergers in an external 
gravitational potential. In other words, they compute the decrease of orbital kinetic energy when 
binaries disappear by merging. Here, we want to include the orbital kinetic energy loss due to tidal 
perturbations, thus computing the energy loss when mergings do not take place. 

We assume a simple physical model to derive an analytic expression for the expected A(u). 
Assume an initially stable self-gravitating gas of particles. Imagine switching on the particle internal 
degrees of freedom at a time to. Now, during the motion of particles within the system, tidal effects 
increase the particle internal energy at the expense of the orbital kinetic energy of the particles. 
This system is apparent unstable, tending ultimately to a general merging of 'hot' particles. This 
process indeed occurs in galaxy groups (see e.g. Mamon 1992b; Diaferio et al. 1993; Doe et al. 1995; 
Weil & Hernquist 1996). 

The most serious shortcoming of this model is that we assume a constant particle mass, clearly 
incorrect because merging and tidal stripping are ongoing processes. Both processes increase the 
kinetic energy loss. When galaxies merge, a binary system disappears and its relative kinetic energy 
is completely transferred to the internal energy of the remnant. Tidally stripped matter forms a 
common background envelope. Particle cores also lose kinetic energy through dynamical friction 
against this background. Therefore, the main consequence of ignoring mass loss is to underestimate 
the kinetic energy loss. 

However, with these hypotheses and the assumption that the unperturbed velocity distribution 
is Gaussian, we derive the perturbed distribution (see the Appendix) 



A(u, a)du — C(a, a, ip) exp 



1 - aH 



aV2 



du 



(4.1a) 



where C is the normalization constant and the function H can be expressed in terms of the modified 
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Bessel functions. In addition to the velocity dispersion a, this distribution depends on two param- 
eters a and ip. If we identify particles with galaxies, both a and ip contain information about the 
similarity of the galaxy internal dynamics to the galaxy system dynamics. In fact, 

<p oc (4.1o) 

a r c 

and 

»«(7)s (4 - lc) 

where Uj is the velocity dispersion of the stars within an individual galaxy, r c and R s are the galaxy 
and the system size, respectively, and p m ; n is the maximum separation at which two interacting 
galaxies are considered a merger remnant. 

In order to test the hypotheses leading to equations (4.1), we follow the evolution of a King 
sphere (King 1966) with Hernquist's (1987) ./V-body code. King spheres have a truncated Gaus- 
sian velocity distribution and are stable self-gravitating systems. Therefore, they approximate the 
boundary conditions of our physical model. 

We sample two King spheres with central gravitational potential 4>(0)/<r 2 = —12. The system 
sphere contains N g — 50 particles; the galaxy sphere contains N s — 100 particles. We evolve each 
sphere in isolation for 4.8 collapse times t c . We set the tolerance parameter 9 = 0.8 and the time 
step At = 10~ 3 t c . A softening parameter e = 0.1r t , where r t is the tidal radius of the sphere, 
insures suppression of two-body relaxation effects. As expected, the spheres are dynamically stable 
and their velocity distributions remain remarkably Gaussian for the entire integration. 

We then replace each particle of the final system sphere with a 100 particle final galaxy sphere. 
In other words, the 50 single particles become resolved "galaxies" containing 100 particles each. 
We opportunely rescale particle velocities and relative positions to ensure dynamical equilibrium 
and to suppress two-body relaxation within each galaxy. This procedure is equivalent to switching 
on the particle internal degrees of freedom. We evolve the system for 2At c . Simulation units are 
G = M = R = 1, where G is the gravitational constant, M the total mass and R the radius of 
the system. At each time, we identify galaxies from particle positions through a generalization of 
the friends-of-friends algorithm (Diaferio, Geller, & Ramella 1994). After 2At c the galaxy number 
usually decreases from 50 to ~ 30. 

We ran five simulations with different random number seeds. Fig. 5 shows the time evolution 
of the distribution \{u) of the galaxy pairwise velocity difference moduli. Differences among the 
distributions of the five simulations arise from statistical effects only. Thus, in Fig. 5 we suppress 
statistical noise by summing the distributions of the five simulations at each time. Two fits are 
superimposed: the bold curve is the perturbed Maxwellian distribution given in the Appendix (eqs. 
[A.12]) 

\(u)du — C(<7, a, p)u 2 exp ( — ^— ^ J 1 — aH ( -j= , ip\ du (4.2) 



and the solid curve is the Maxwellian distribution, i.e. equation (4.2) when a = 0. 

In the perturbed Maxwellian we set tp = 1.22 according to equation (A. 4); a and a are free 
parameters. By adding ip as a free parameter the fits do not change significantly and ip remains 
in the range 1.00 -j- 1.40. Therefore, the perturbed Maxwellian only has a as an additional free 
parameter compared with the Maxwellian distribution. Fig. 5 shows that the perturbed Maxwellian 
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Figure 5 Time evolution of the distribution of pairwise velocity difference moduli for systems of particles 
with internal degrees of freedom. The distribution is initially Maxwellian but at later times the frequency 
of small relative velocities increases. The best Maxwellian fits (solid lines) and the equation (4.2) fits (bold 
lines) are shown. The latter are slightly better than the former as shown by the ratios of the X 2 ' s - in 
simulation units, the Maxwellians have fit parameters \[2u = 1.43, 1.42, 1.39, and 1.41 at t/t c — 0.0, 0.8, 

1.6, and 2.4, respectively. The perturbed Maxwellians have fit parameters \/2a = 1.44, 1.45, 1.41, and 1.44 
and a — 0.01, 0.13, 0.14, and 0.15, respectively. 



describes the increased frequency of small relative velocities. However, the frequency increase is 
barely detectable, despite the fact that typically Uija ~ 0.9 (eq. [4.1c]); i.e. the ratio of the velocity 
dispersions is not negligible. The ratio of the x 2 's of the two distributions shows that the perturbed 
Maxwellian fits the numerical distribution only slightly better than the Maxwellian distribution. 

At earlier times (t S5 2t c ) most galaxies have not yet merged and their masses have not been 
reduced significantly by tidal stripping. Therefore, the physical model outlined in the Appendix is 
approximately valid. When we follow the system evolution for t > 2At c we find that A (it) does not 
usually tend to depart farther from the unperturbed distribution. At these later times however, 
comparison of equation (4.2) with the numerical distributions is meaningless. Mergers create one 
large merger remnant surrounded by galaxies with masses 0.1 times the dominant galaxy mass. 
Thus, the system no longer contains galaxies similar in mass and our simple physical assumptions 
break down. 

Thus, Fig. 5 confirms that A(it, er) in equation (4.1a) should be valid for galaxy systems con- 
taining galaxies of similar mass. However, if A (it, a) is Gaussian in the absence of galaxy internal 
degrees of freedom we expect that small departures from a Gaussian distribution will arise from the 
transfer of energy to the internal degrees of freedom. 

In order to test equation (4.1a) against A(u)'s of real systems, we compare equation (4.1a) 
with Hickson's compact groups (Hickson 1993). These systems are the densest in the Universe 
(<~ 10 5 galaxies per h 3 Mpc~ 3 ) if they arc not two dimensional projections of unrelated galaxies 
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Figure 6 Galaxy pairwise velocity difference distribution for the 69 Hickson compact groups with N > 4 
members. The solid line is the best fit Gaussian and the bold line is the best fit perturbed Gaussian (eq. 
[4.1a]). The Gaussian has fit parameter \[2a = 391 km s _1 . Equation (4.1a) has fit parameters \f2a = 487 
km s _1 and a = 0.69. The dashed histogram is the pairwise velocity difference distribution for the compact 
groups with N > 4 members in the simulated catalog of Diaferio et al. (1995). 

(Mamon 1992a; Hernquist, Katz, & Weinberg 1995). Therefore, we expect that the kinetic energy 
loss effects might be detectable in these extreme systems. Moreover, Hickson's brightness selection 
criterion requires that galaxy members lie within an interval of three magnitudes, assuring that 
galaxy members are not very different in mass. 

Fig. 6 shows A(u) for the 69 Hickson (1993) compact groups with N > 4 galaxies. We sum 
all the single distributions for the whole sample of 69 compact groups because we assume that each 
compact group is a sample of the same parent distribution. We base this approach on the model 
first proposed by Diaferio, Gcllcr, & Ramella (1994, 1995) that observed compact groups may be 
identified with substructures in collapsing rich loose groups. Ramella et al. (1994) search the redshift 
neighborhoods of compact groups within the CfA magnitude limited redshift surveys and confirm 
that at least 70% of compact groups are embedded in larger systems. If compact groups share this 
same origin, we may assume that the pairwise velocity distribution of each compact group is sampled 
from the same parent distribution. 

Equation (4.1a) (bold curve) clearly fits the observed distribution better than a Gaussian dis- 
tribution (solid curve). This result indicates that compact group galaxies are loosing kinetic energy 
in a way consistent with our simple physical model. Moreover, A(u, a) is only slightly perturbed 
compared with a Gaussian, indicating that the galaxies still retain most of their orbital kinetic en- 
ergy. This conclusion agrees with the hypothesis that compact groups have just collapsed and that 
most galaxies are at their first encounter within the compact group (Diaferio et al. 1994). 

Fig. 6 shows a further confirmation of the validity of this model of the formation of compact 
groups. The dashed histogram is the pairwise velocity distribution computed from the simulated 
catalog of compact groups with N > 4 members (Diaferio et al. 1995). The Kolomogorov-Smirnov 
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Figure 7 Galaxy pairwise velocity difference distribution for the central region of the Abell cluster A576. 
The solid line is the best fit Gaussian and the bold line is the best fit perturbed Gaussian (eq. [4.1a]). The 

ratio of the x 2 ' s shows that the two fits do not differ. The Gaussian has fit parameter \[2o = 1589 km 
s" 1 . Equation (4.1a) has fit parameters \[2o = 1607 km s _1 and a — 0.05. 

test shows that the observed compact group sample and the simulated sample belong to the same 
parent distribution at the 16% significance level. 

However, the difference between the two distributions is not a statistical effect: the iV-body 
model systematically underestimates the frequency of small relative velocities. This result suggests 
that the model dynamical resolution is insufficient to resolve the transfer of kinetic energy completely. 
In other words, the model tends to merge galaxies before Nature does. This result is expected: the 
model only accounts for dissipationless galaxy formation processes. Dissipative processes decrease 
the galaxy merging cross-sections and galaxies survive for a longer time against merging than in 
dissipationless TV-body simulations (e.g. Evrard, Summers, & Davis 1994; Frenk et al. 1996). 

We finally consider the pairwise velocity distribution for a galaxy cluster. The perturbed dis- 
tribution in equation (4.1a) depends on the fourth power of the ratio between the internal velocity 
dispersion of individual galaxies and the velocity dispersion of the galaxies within the cluster (eq. 
[4.1c]). Thus, we expect a nearly Gaussian A(u,a) for a massive virialized cluster. We consider 
the Abell cluster A576 (Mohr et al. 1996). The 85% complete magnitude limited sample contains 
169 galaxies lying within a projected distance r 1.5h Mpc. The cluster mass lies in the range 
~ 1t4x 10 15 h~ 1 M Q implying a turnaround radius ~ 3.0/i _1 Mpc. Therefore, the infall region 
around the turnaround radius is not sampled. Of the 169 galaxies within the central region, 58 
galaxies have spectra with line emission and 111 have no line emission. Mohr et al. (1996) identify 
these two samples with galaxies containing or not containing star formation regions, respectively. 
They also generically identify them with late-type or early-type galaxies. Mohr et al. (1996) show 
that the late-type galaxies have a velocity distribution broader than the early-type galaxies and 
identify late-type galaxies with galaxies falling into the central region for the first time. Thus, if 
we exclude these galaxies, the subsystem of early-type galaxies is in approximate virial equilibrium. 
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Mohr et al. (1996) argue that the absence of apparent substructures in the distribution of the 
early-type galaxies confirms this assumption. Fig. 7 shows the pairwise velocity distribution for the 
non-emission line galaxies only. In contrast with the compact group sample where Oijo ~ 1, here 
we have c^/cr <~ 0.3 (eq. [4.1c]). We fit the distribution with a and a. free parameters; ip = 1.22 as 
for the numerical experiment. As expected from equation (4.1c), the Gaussian and equation (4.1a) 
fit the observed distribution equally well. 

5. THE REDSHIFT-SPACE CORRELATION FUNCTION 

In the preceding sections we investigate how the exponential shape of the PVDF depends on the 
galaxy system number density n(a) and the internal galaxy pairwise velocity distribution A(it, a) 
of individual systems. We assume that all the galaxy systems have the same A(u,a). We then 
investigate how A(it, a) varies depending on the internal dynamics of each system. 

We now investigate how the redshift space correlation function £ s depends on the galaxy system 
number density n(a) and A(u,a) through the PVDF. We restrict ourselves to Gaussian and expo- 
nential A(u,er)'s. We do not discuss the A(u) distribution for dissipative systems (Sect. 5), because 
this distribution is only marginally distinguishable from a Gaussian for real systems. 

The "convolution method" (e.g. Fisher 1995) expresses the redshift-space correlation function 
6 as 

/+oo 
[1+Z(r)}p(u)dy (5.1) 
-oo 

where r p is the spatial separation of the galaxy pair projected on the sky, ir is the velocity difference 
along the line of sight, r 2 = r 2 + y 2 , y is the pair spatial separation parallel to the line of sight, 
and u — it — y is the peculiar relative velocity; 2 £(r) is the real-space correlation function and p(u) 
is the PVDF. In equation (5.1) we assume (1) that the PVDF is independent of r and (2) that 
the mean relative peculiar velocity vnir) of galaxy pairs separated by r is zero. Both assumptions 
are reasonable when O.l/i" 1 Mpc ^ r p ^ l/i -1 Mpc, where galaxy velocities are almost random 
(vi2(r) ~ 0) and the pairwise velocity dispersion au(r) ~ const (Marzke et al. 1995; Fisher 1995). 

We consider the redshift-space correlation function averaged over the projected separation r p , 
namely 

1 [ r ™^ 

^*min i ^max ; 7i")) = / £,s(r p ,n)dr p . (5.2) 

Fig. 8 shows the comparison of the measured (£s(tt)) for different intervals of projected sep- 
arations [r m j n ,r max ] with different models of the PVDF convolved with the real space correlation 
function £(r). We use £(r) = (r/r ) 7 , where r = 5.97 ± 0.15/j" 1 Mpc and 7 = -1.81 ± 0.02 as 
measured by Marzke et al. (1995) for the CfA redshift survey (CfA2) and the Southern Sky Redshift 
Survey (SSRS2) galaxy samples combined (CfA2 + SSRS2). In Fig. 8, squares are the measured 
(£ s (tt)) for this sample with [r min ,r max ] = [0.1,0.2], [0.2,0.4], [0.4,0.8], and [0.1, 1.0] /i -1 Mpc. We 
superimpose the curves computed through the integrals in equations (5.2) and (5.1), where p{u) is 
the integral in equation (2.1) with cr min = 100 km s _1 and cr max = 1500 km s^ 1 . We show the 
curves with a Gaussian internal velocity distribution A(u, a) (dashed lines) and with an exponential 
A(u, a) (solid lines). We compute the curves with both the observed n(a) (eq. [2.4]) and with the 
distribution derived from the Press-Schechter theory (eq. [2.3a]) for as — 0.5, 1.0, and 1.5. 

2 Here we assume that the velocities are in units of the present Hubble constant Hq. 
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Figure 8a Comparison of the observed redshift-space correlation function (£ s (7r)) with models of the PVDF 
convolved with the observed real space correlation function (eqs. [5.2] and [5.1]). Squares are the measures 
from the CfA2+SSRS2 redshift survey samples with [r mm ,r max ] = [0.1,0.2], [0.2,0.4], [0.4,0.8], and 
[0.1, 1.0] /i -1 Mpc respectively, as shown in each panel (Marzke et al. 1995). Curves are the PVDFs (eq. 
[2.1]) with a m i n = 100 km s _1 and a max — 1500 km s _1 . Solid (dashed) lines are computed with an 
exponential (Gaussian) internal pairwise velocity distribution A(lt, a). The number density of galaxy system 
is the observed n(a) (eq. [2.4]) or the Press-Schechter n(a) for a flat CDM universe (eq. [2.3a]) with different 
normalization a$: (a) as — 0.5; (b) as = 1.0; (c) as = 1.5; (d) observed n(a). 

cr a =1.0 




5 10 1 15 20 5 lOj 15 20 
7T / h Mpc 7T / h Mpc 



Figure 8b 
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Figure 8d 



In order to quantify the agreement between the curves and the data for each projected separation 
interval, we plot the x 2 per degrees of freedom v as a function of the upper limit r max of the interval 
(Fig. 9). We do not derive a parameter from the twenty data points within each range of r p , thus 
we assume v = 20. The x 2 's are indicative and are meaningful only if we compare them with each 
other. The data are actually correlated and the estimates of (£ s (7r)} are not normally distributed. 
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Figure 9 Reduced % 2 with v = 20 degrees of freedom for the curves in Fig. 8 as a function of the upper limit 
r max °f the integral in equation (5.2). The four upper limits correspond to the four projected separation 
intervals [r min ,r max ] = [0.1,0.2], [0.2,0.4], [0.4,0.8], and [0.1, l.Oj/i" 1 Mpc. Open (filled) squares are for 
an exponential (Gaussian) A(u, a). 

Therefore, we should use a different approach to estimate \ 2 ( see Fisher et al. 1994a; and Marzke 
et al. 1995 for further details). 

The observed n(a) approximates the data better when it weights an exponential A(u) (Fig. 9, 
open squares) rather than a Gaussian (filled squares). However, systems selected from a redshift 
survey exceed a particular density contrast threshold 5p/p. The n(a) we use thus contains only 
systems with Sp/p > 80 with respect to the background (Zabludoff et al. 1993b). Therefore, the 
observed n(a) does not contain enough systems with small density contrast, and presumably low a, 
by definition. The exponential A(u) partially compensates this underestimate and the agreement is 
better. 

In any case, the theoretical n(er) confirms that an exponential A(u) reproduces the data better 
than the Gaussian A(u), although not for all the normalizations ag. The theoretical n(a) also 
depends on the power spectrum P(k) (eq. [2.3c]) and on the density of the Universe through the 
perturbation growth factor which enters the Press-Schechter distribution function. Thus, we must 
interpret the implications of Fig. 9 about as cautiously. 

The agreement between curves and observations also depends on the projected separation inter- 
val. Marzke et al. (1995) computed (£ s (tt)} for the data with the intervals shown in Fig. 8. However, 
the galaxy coordinates in the Zwicky catalog, on which the CfA survey is based, are accurate to 
~ l-j-1.5 arcmin. At redshift cz = 10000 km s -1 , the 3cr error is thus ~ 0. 09-^-0. 14/i -1 Mpc, implying 
an error in the projected separation ~ 0.13-^0.19/i _1 Mpc. With the current data, large errors prob- 
ably contaminate the interval [r m ; n ,r max ] = [0.1, 0.2]/i _1 Mpc. Intervals with r m i n > 0.2/i _1 Mpc 
are more reliable. We also emphasize that galaxy velocities often have uncertainties ^ 50 km s _1 , 
which means typical errors ^ 70 km s _1 in the relative velocities. Thus, we regard the measures of 
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£ s at 7r = 50, 150 km s -1 with caution. 

Large differences in x 2 's between the exponential and the Gaussian A(u, a) (see for example Fig. 
9 when as = 0.5 or when as = 1.0 and [r m i n , r max ] = [0.1, 0.2] or [0.4, 0.8]/i _1 Mpc) originate mainly 
at small relative velocities tt < 200^-300 km s" 1 (Fig. 8). Thus, by improving the accuracy of galaxy 
relative positions and velocities by at least a factor of two, the redshift-space correlation function 
£ s can (1) clearly discriminate among the models, and (2) can probably separate the cosmological 
contribution represented by n(a) from the contribution of the internal dynamics of galaxy systems 
represented by A(u, a). Moreover, on very small scales (r p & O.l-f-0.2/?. - 1 Mpc, tt ^ 100-^200 km s" 1 ), 
we expect that the dissipative effects outlined in Sect. 4 will become more apparent. Thus, £ s can 
constrain the importance of mergers in the present Universe. 

The main conclusion of our analysis is that agreement between the models of the exponential 
PVDF and the observed redshift-space correlation function on non-linear scales depends strongly on 
both the underlying cosmogonic model [namely n(a)} and the internal dynamics of galaxy systems 
[i.e. A(u, a)]. Neither aspect dominates. However, reliable measures of £ s at small scales can separate 
the two contributions and provide further constraints on the model of the Universe. 

6. CONCLUSION 

Marzke et al. (1995) measured the redshift-space correlation function £ s for galaxy samples 
of the Center for Astrophysics (CfA) redshift surveys for galaxy separations l/i -1 Mpc. An 
exponential galaxy pairwise velocity distribution function (PVDF) yields the best fit. This result is 
common to other redshift surveys (e.g. Bean et al. 1983; Fisher et al. 1994b). 

We propose a physical explanation for this observed exponential shape. If all galaxies belong to 
isolated galaxy systems with velocity dispersion a, the PVDF is the weighted sum of the distributions 
A(u, a) of the pairwise velocities u within each system. The weight depends on the galaxy number 
v(a) within each system and the number density n(a) of the systems within the sample. 

We assume that A(it, a) is a universal function, identical for each system. This assumption is 
inadequate, because we show that the shape A(u, a) depends on the dynamical state of the system. 
However, if we assume that all the system are virialized, A(u) is Gaussian and v(a) oc a 2 . In this 
case, both the observed n{a) and the n(a) predicted by the Press-Schechter theory in a flat CDM 
Universe yield a nearly exponential PVDF, but only at large relative velocities u. In order to obtain 
an exponential central peak, A(u) has to be more centrally peaked than a Gaussian distribution. 
When a galaxy system is unrelaxed, substructures and infall regions contribute to a centrally peaked 
A(u). We limit our analysis to an exponential A(u, a) which yields the expected central peak of the 
PVDF. The Gaussian and the exponential distributions represent the two limiting cases. A more 
detailed analysis of the physical origin of A(u, a) is likely to yield a A(u, a) between these two cases. 
Therefore, we conclude that the observed exponential PVDF testifies to the presence of a large 
fraction of unrelaxed galaxy systems in the present-day Universe. 

A third process may increase the frequency of small relative velocities: the transfer of orbital 
kinetic energy to galaxy internal degrees of freedom. We derive an analytical A (it) which accounts for 
energy transfers driven by tidal perturbations. We predict that these perturbations are detectable 
in galaxy systems with a ratio ^ 1 between the internal velocity dispersion of individual galaxies 
and the velocity dispersion of galaxies within the system. We confirm this prediction by comparing 
the analytic distribution with A-body simulations and with observed compact groups. 



21 



Finally, we compare the measured redshift-space correlation function £ s with the convolution of 
different models of the exponential PVDF with the measured real-space correlation function. The 
agreement between models and observations depends strongly on both the underlying cosmogonic 
model and the internal velocity distribution A(u, a) of galaxy systems. These two effects are of 
comparable importance over the entire range of relative velocities tt < 2000 km s _1 . 

We expect to be able to disentangle the two effects with more accurate galaxy coordinate 
and relative velocity measurements. In fact, the redshift-space correlation function £ s at projected 
separations r p ^ 0.5/i _1 Mpc and relative velocities n ^ 300 km s _1 is very sensitive to the shape 
of A(w, a). Thus, a better measure of t; s at very small scales poses strong constraints on the shape 
of A(u, a) and will improve our understanding of the dynamics of galaxy systems on these scales. 

After the submission of this paper, we learned of Shcth's (1996) independent work on the prob- 
lem of the exponential shape of the PVDF. He investigates a model similar to the model we outline 
in Sect. 2. He performs an accurate comparison of this model with A^-body simulations and shows 
that for initial density perturbations with power-law power spectra the PVDF is well approximated 
by an exponential. He shows that the assumptions which underlie our analytic approach in Sect. 2 
include the relevant physics. 
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obscure statements. We thank an anonymous referee for the clarifying suggestions of a prompt 
report. Invaluable long discussions with Paola Ciarpallini and her inexhaustible support made this 
work possible. We warmly dedicate this work to Paola. This research is supported in part by NASA 
Grant No. NAGW-201 and by the Smithsonian Institution. A.D. was a Center for Astrophysics 
Prc-Doctoral Fellow. 
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APPENDIX 



Here we derive the pairwise velocity difference distributions given in equations (4.1) and (4.2) 
for dissipative systems. 

We assume a self-gravitating gas of particles with an initial Gaussian velocity distribution. The 
particles have internal degrees of freedom; tidal effects increase the particle internal energy during 
particle motion within the system at the expense of the orbital kinetic energy of the particles. In the 
following derivation we ignore particle mass loss and therefore we underestimate the total kinetic 
energy loss as discussed in Sect. 4. 

First we compute the fraction of the relative kinetic energy transferred into the particle internal 
degrees of freedom. We then derive an approximated pairwise velocity difference distribution. 

Suppose that the particles have equal mass m. E is the relative kinetic energy per unit mass 
of two particles in their center of mass reference frame. If v is their relative velocity we have E = 
t> 2 /4 = <j 2 u 2 /2 where a s is the one-dimensional velocity dispersion of the system and u = v/^/2a s . 

Spitzer (1958) considered the encounter between a cloud and a star cluster and computed the 
relative energy transfer to the internal energy of the cluster during the encounter. The derivation 
is similar to the derivation of the energy transfer in a Coulomb collision between a moving charge 
and a harmonically bound charge in the dipole approximation (Jackson 1962). In fact, Spitzer 
assumed that for the stars in the cluster (1) the tidal force of the cloud is small compared with the 
gravitational attraction of the cluster and (2) that the internal gravitational potential of the cluster 
is proportional to the square of the distance from the cluster center, i.e. the stars are harmonic 
oscillators with the same frequency. 

In Spitzer's paper the cloud and cluster have mass m n and m c respectively, r 2 is the mean 
square cluster radius, l/u> the oscillation period of the stars in the cluster, v the cloud-cluster relative 
velocity and p the impact parameter. Spitzer showed that the increase of the cluster internal energy 
after a single encounter is 



where K and K\ are the usual modified Bessel functions, and 9 = 2top/v. 

When v — > oo, L{9) — > 2(1 + 29 2 ) — > 2 and equation (A. la) reduces to the usual impulse 
approximation. When v — > 0, L(9)/v 2 — > tt9 3 cxp(— 29)/v 2 — > 0; the energy loss falls exponentially 
to zero. Weinberg (1994a, b, c) shows that such an adiabatic cutoff is not correct in general. In fact, 
AE approaches zero when lo/v —* oo. However, a stellar system always has stars with arbitrary 
small uj and for those stars we never have w/v — > oo. The perturbation suffered by those stars 
ultimately affects the internal dynamics of the whole stellar system. Thus, assuming AE ~ when 
v — > will underestimate the energy loss. However, we would detect such energy loss only on time 
scales ^ R/v where R is some characteristic size of the system. In other words, we underestimate 
the energy loss only if we observe the system for a sufficiently long time interval. 

We apply equations (A.l) to galaxy systems where the adiabatic approximation does not break 
down because we do not observe them for a long enough time. In fact, either galaxy systems are 




(Ala) 



where G is the gravitational constant and 



L{6) = 29 2 [9 2 K 2 (9) + 9K (9)K 1 {9) + (1 + 9 2 )K 2 (9)} 



(A.lb) 
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dynamically young because they suffer a merging instability and arc therefore intrinsically unstable 
(groups) or their size is large enough that when v is small, R/v is greater than the Hubble time 
(clusters). Therefore, we expect that equations (A.l) approximate the energy loss for galaxies within 
groups and clusters. 

Let us now apply equations (A.l) to a system of equal mass galaxies. All possible galaxy pair 
combinations reduce to a "reduced" galaxy with mass m n = m/2 (the "cloud") which moves with 
velocity v through a field of fixed sources with mass m c — 2m (the "clusters"). The "cloud" loses 
a fraction of its kinetic energy at a rate given by AE times the number of collisions per unit time, 
integrated over the impact parameter p and the mass of the "clusters" m c . In other words, if n(m c ) 
is the number density of "clusters" with mass between m c and m c + dm Cl we have 



dE f 

— = - I AE x v x 2npdp x n(m c )dm 
mlrl (2G\ 



= v x 



2tt (—^j J ~QT~ d0 J m cn(m c )dm c . (A.2a) 



v ) 

We assume that all the "clusters" have the same mass ra c = 2m. Thus, the "cluster" mass 
spectrum per unit volume is n(m c ) = no5(2m — m c ), where 5 is the usual Dirac delta function. 

The quantity 8 m i n depends on the validity of assumption (1) above, namely that the "cloud" 
exerts a tidal force on a "cluster" star which is small compared to the "cluster" attraction. We can 
write 

min = **E*L S *. (A3) 
V2a s u u 

The oscillation period 1/ui of the star within the "cluster" is roughly twice the inverse of the crossing 
time r c j \f3~Oi where G{ is the one-dimensional velocity dispersion within the "cluster" . Therefore 

For p m i n ~ r c and cr, ~ <j s we expect ip — \/6/2 ~ 1.22. 

Applying the recursion formulas dK /d9 = —K\ and dK\jdQ = —K§ — K\/6 the indefinite 
integral over 9 is 

- J ^d9 = 28K a (8)K 1 (8) + Kl{6) = L{8). (A5) 
Finally, equation (A. 2a) reduces to 

f -t ^ !,a 4'© ■ < A2! » 

For the virial theorem Gm = 3afr c . Moreover, u = V3<Ji/2r c and no = 3N/4ttR^ where R s is 
the size of the system and N is the total number of galaxies within the system. We can write 
N = GfMtot/Gm = f(<r s /<Ji) 2 (R s /r c ) where fM to t is the fraction of the total mass concentrated 
in galaxies. The crossing time of the system is t cr — ^/3a s /R s , thus we have 

dE mal 9V6^ f a i\ 6 f r c\ 2 1 ? 



u J Vtt/ 



dt t cr 8 \u s J \ R s 

mal 9V6 . f (tA* ( r c \ 1 ~ f(fi\ 
_/ UJ (r s ^ L \u) 



t. 



cr 



P\l (?) (A.2c) 
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where we have introduced the constant 



9V6 



OS 



P=-^f - • (A.2d) 



The constant (3 contains information about the size and the internal velocity dispersion of galaxies 
compared with those of the whole system. It is apparent that the energy transfer into galaxy internal 
degrees of freedom is mainly sensitive to the ratio of the velocity dispersions. The constant (3 specifies 
when we can ignore the galaxy internal degrees of freedom. We see that within galaxy groups the 
energy transfer must be more clearly detectable than in galaxy clusters. 

The energy transfer rate per unit mass in unit of a 2 s may finally be written 

dt t cr ^ u 3 ~ (u) (6) 

Suppose now that we know the relative kinetic energy distribution q(E)dE at time t. We wish 
to compute the energy distribution X(E)dE at time t + rjt cr , when rj — > 0, so that u ~ const. 

Using equation (A. 6), the relative kinetic energy per unit mass in unit of at the time t + r/t cr 
is, to first order in 77, 

dE 

E(t + rjt cr ) = E(t) + —vtcr 
dt 

= E(t) - »?/3A(it, <p) (A.7a) 

where 



Now, the probability density X(E)dE is 



A(u^) = -L(^). (A.7b) 

u A \uJ 



\{E)dE = q[G- 1 {E)] dG — ^ dE (A8) 
dE 

where the inverse function G _1 is defined through equations (A. 7) 

E = G(E ) = E - vP^(Eo) (A.7c) 

and E(t) = E . 

In the limit r\ — > 0, we have to first order in 77 

G~\E) =E + V (3A(E) (A.9) 

and we may rewrite equation (A. 8) 

-d\og[q(E)} d\og[A(E)Y 



\{E)dE = q(E) { 1 + r]pA(E) 



dE dE 



dE. (AlO) 



Equation (A. 10) is valid only when 77 — > 0, i.e. for times close to t when the distribution q(E) 
is known. We should obtain the distributions X(E, t)dE at different times t by solving the correct 
Boltzmann equation specific for our problem. However, we wish to have an analytic distribution to 
compare with real systems. Therefore, we go further and use equation (A. 10) tout-court assuming 
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a = rj/3 is a free fit parameter. We justify this assumption with the following argument: if q(E) = 
\o(E) at time to and X(E) = Xi(E) at time t\, we again have equation (A. 10) at time i 2 where 
q(E) — > Xi(E) and A(E') — > A 2 (£ ; ). To first order in 77, rj[3 — > 277/?. Thus, if we apply this iterative 
procedure, the form of equation (A. 10) does not change, but now the coefficient a tells us how far 
the system is dynamically from the initial distribution q(E). Moreover, the coefficient a is a measure 
(1) of the time scale of the relative velocity change because of kinetic energy loss (77) and (2) of the 
similarity of the galaxy internal dynamics to the galaxy system dynamics (/3). 
If we assume a Maxwellian q(E) 

q(E)dE ocE 1 / 2 e- E dE (All) 

equation (A. 10) becomes, in terms of the velocity modulus u = v/^/2a s = {2E) 1 / 2 and with the 
explicit expression of A(u) (cq. [A. 7b]), 

\{u)du oc u 2 cxp ( — ^- j [1 — aH(u, Lp)]du (Al2a) 



where 



u A \uJ u A u 6 do 



d9 ] 

The one-component velocity density distribution A(u)du is related to the density distribution 
of the velocity moduli through the equation (Feller 1966) 

\(u)du = -u^P-. (A13) 
du 

Equation (A. 13) holds for any isotropic three-dimensional random field. With the boundary condi- 
tion A(u) — > when u-toowe obtain 



f° 

A(u)du cx / 

J u 



^dt. (A14) 



In other words, 



where 



u 2 



A(u)du cx cxp ( — — ) [1 — aH(u, ip)]du (Al5a) 



H(u,<p) =cx p(y)^ texp(-^jH(t,ip)dt. (A156) 
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